The use of machines in the construction of a grammar and computer program for structural analysis

نویسندگان

  • Kenneth E. Harper
  • David G. Hays
چکیده

The present paper describes progress made on the building of a descriptive grammar of Russian with the complementary efforts of man and machine. Linguistic research at the Rand Corporation begins with the collection on punched cards of a large quantity of raw text from Russian physics journals. As described elsewhere in detail, a total of 250,000 running words of text is being processed, in corpora of about 30,000 words each. Post editors supply codes to indicate (a) the structure of the Russian sentence and (b) its translation into English. In this way the relative position of each word in the structure of the whole sentence is recognized and codified. Dependency codes are then punched back into the text cards. The entire corpus is then machine-sorted and listed according to the structural and morphological type of each item in the text, and according to lexical entries. Syntactic analyses of these listings lead to the identification of word classes according to function (the extension and modification of traditional grammatical classifications) and to identification of the relations between syntactic units of the sentence. The word classes and functional relationships thus determined are imbedded in a computer program for sentence-structure determination that is now being tested. The program establishes a relationship between two words in a specific sentence when: (a) the words belong to classes that, in general, can be related, and (b) all intervening words in the sentence have previously been related to one or the other of the words in question. The sum of the word classes and functional relationships that can exist among them is a grammar for Russian physics texts, while the computer program for translation is a working statement of the grammar. The empirical questions now under test are: (a) What word classes and functional relationships are to be recognized for Russian ? (b) Do the computer-determined sentence structures match those given for the same sentences by linguists ? Harper • The use of machines in the construction of a grammar and computer program for structural analysis 189

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Analysis of a Novel Three-phase Axial Flux Switching Permanent Magnet Generator with Overlapping Concentrated Winding

This paper proposes a novel axial flux switching permanent magnet generator for small wind turbine applications. Surface mounted axial flux switching permanent magnet (SMAFSPM) machine is a new type of these machines that is introduced in this paper. One of the most important challenges in optimal designing of this kind of machines, is ease of construction and maintenance. One of the main featu...

متن کامل

Path analysis of occupational injuries based on the structural equation modeling approach: a retrospective study in the construction industry

Background and aims: The construction industry, sites, and projects are the most dangerous industries in terms of the risk of occupational accidents and injuries. Important factors that have led the industry as a health, safety, environment (HSE) high-risk industry in the world can be cited such as continuous changes in construction projects, using a lot of resources, poor working conditions,...

متن کامل

Analysis of Tall Buildings with Bundled Tube System Subjected to Wind and Earthquake loads

At present, the tubular structural systems are mainly used in tall buildings to withstand earthquake loads. Although it is possible to analyse the structure by finite element methods using standard three dimensional programs, the system is generally time-consuming and expensive in the primary design work. In this paper, for the analysis of Framed-Tube systems, a simple method was studied and de...

متن کامل

Analysis of Tall Buildings with Bundled Tube System Subjected to Wind and Earthquake loads

At present, the tubular structural systems are mainly used in tall buildings to withstand earthquake loads. Although it is possible to analyse the structure by finite element methods using standard three dimensional programs, the system is generally time-consuming and expensive in the primary design work. In this paper, for the analysis of Framed-Tube systems, a simple method was studied and de...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1959